# End-to-End Segmentation
Coco Panoptic Eomt Giant 640
MIT
The model proposed in this paper reveals the potential of Vision Transformer (ViT) in image segmentation tasks.
Image Segmentation
C
tue-mps
92
0
Segmentation
MIT
This is an end-to-end speaker segmentation model for voice activity detection, overlap speech detection, and resegmentation tasks.
Audio Processing
TensorBoard

S
salmanshahid
1,790
0
Vad
MIT
A voice activity detection model based on pyannote.audio, used to identify active speech segments in audio
Speech Recognition
V
salmanshahid
1,794
1
Featured Recommended AI Models